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Abstract 

We show that the unrestricted black-box complexity of the n-dimensional XOR- and 
permutation-invariant LeadingOnes function class is 0(nlog(n)/loglogrt). This shows 

■ that the recent natural looking 0(n log n) bound is not tight. 
The black-box optimization algorithm leading to this bound can be implemented in a 

way that only 3-ary unbiased variation operators are used. Hence our bound is also valid for 
the unbiased black-box complexity recently introduced by Lehre and Witt. The bound also 

■ remains valid if we impose the additional restriction that the black-box algorithm does not 
, have access to the objective values but only to their relative order (ranking-based black-box 

complexity) . 
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The black-box complexity of a set J- of functions S — > WL, roughly speaking, is the number 
■ of function evaluations necessary to find the maximum of any member of J- which — apart 

Vj^ . from the points evaluated so far — is unknown. This and related notions are used to describe 

how difficult a problem is to be solved via general-purpose (randomized) search heuristics. 
Consequently, black-box complexities are very general lower bounds which are valid for a wide 
range of evolutionary algorithms. A number of different black-box notions exist, each capturing 
| different classes of randomized search heuristics, cf. |D JW06| . |LW10j . and |DWllj . 

In this paper we consider the unrestricted black-box model by Droste, Jansen, and We- 
gener }DJW06j and the unbiased black-box model by Lehre and Witt [LWlOj . and we shortly 
remark on the ranking-based models which we propose in |DW11| . A formal definition of the 
first two models is given in Section [2J For now, let us just mention that algorithms in the 
unrestricted black-box model are allowed to query any bit string, whereas in the A;-ary unbiased 
model, an algorithm may only query search points sampled from a fc-ary unbiased distribution. 
That is, in each iteration, the algorithm may either query a random search point or, based upon 
at most k previously queried search points, it may generate a new one. The new search point 
can be generated only by using so-called unbiased variation operators. These are operators that 
are symmetric both in the bit values (0 or 1) and the bit positions. More formally, the variation 
operation must be invariant under all automorphisms of the hypercube. 

In this work, we are concerned with the black-box complexity of the LeadingOnes func- 
tion, which is one of the classical test functions for analyzing the optimization behavior of 
different search heuristics. The function itself is defined via Lo : {0, l} n —> [0..n],x i— > max{i £ 
[0..n] | Vj < i : Xj = 1}. It was introduced in [Rud97] to disprove a previous conjecture by 
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Miihlenbcin [Miih92| that any unimodal function can be optimized by the well-known (1+1) evo- 
lutionary algorithm (EA) in O(nlogn) iterations. Rudolph [Rud97| proves an upper bound of 
0(n 2 ) for the expected optimization time of the (1 + 1) EA on Lo and concludes from ex- 
perimental studies a lower bound of Q(n 2 ) — a bound which was rigorously proven in 2002 by 
Droste, Jansen, and Wegener [D JW02] . This 0(n 2 ) expected optimization time of the simple 
(1 + 1) EA seems optimal among the commonly studied evolutionary algorithms. 

Note that the unrestricted black-box complexity of the Lo function is 1: The algorithm 
querying the all-ones vector (1, . . . , 1) in the first query is optimal. Motivated by this and by 
the fact that the unbiased black-box model only allows variation operators which are invariant 
with respect to the bit values and the bit positions, we shall study here a generalized version of 
the Lo function. More precisely, we consider the closure of Lo under all permutations a G S n 
and under all exchanges of the bit values and 1. It is immediate, that each of these functions 
has a fitness landscape that is isomorphic to the one induced by Lo. To be more precise, we 
define for any bit string z G {0, l} n and any permutation a of [n] the function 

Lo ZjCT : {0, l} n -> [0..n},x i-> maxji G [0..n] | V j < i : = x a ^} . 

We let LeadingOnes„ be the set {Lo ZjCT | z G {0, l} n ,cx G S n } of all such functions. 

Note that this definition differs from the one in [DJW06J, where only the subclass 
LeadingOnes^ := {Lo 2j id | z G {0, l} n } not invariant under permutations of [re] is stud- 
ied. Here, id denotes the identity mapping on [n]. For this restricted subclass, Droste, Jansen, 
and Wegener |DJW06| prove an unrestricted black-box complexity of ©(n). Of course, their 
lower bound O(n) is a lower bound for the unrestricted black-box complexity of the general 
LEADiNGONES ra function class, and consequently, a lower bound for the unbiased black-box 
complexity of LEADlNGONES n . 

The function class LeadingOnes„ has implicitly been studied for the first time in [LWlOj . 
where Lehre and Witt show that indeed the (1+1) EA is provably (asymptotically) optimal 
among all unbiased black-box algorithms of arity at most one. This establishes a natural 0(n 2 ) 
bound for LeadingOnes„. 

Surprisingly, it turns out that this bound does not hold for the unrestricted black-box model 
and that it does not even hold in the 2-ary unbiased black-box model. In [D JK + li] it is shown 
that, assuming knowledge on a(l), . . . ,a(£), one can perform a binary search to determine 
a {I + 1) and its corresponding bit value. Since this has to be done n times, an upper bound of 
0(n log n) for the unrestricted black-box complexity of LEADiNGONES n follows. Furthermore, 
this 0(n log n) bound can already be achieved in the binary unbiased model. Up to now, this is 
the best known upper bound for the unrestricted and the 2-ary unbiased black-box complexity 
of LeadingOnes„. 

In this work we show that both in the unrestricted model (Section [3|) and for arities at 
least three (Section H]), one can do better, namely that 0(nlog(n)/loglogn) queries suffice to 
optimize any function in LEADiNGONES n . This breaks the previous 0(n log n) barrier. This 
result also shows why previous attempts to prove an fi(nlogn) lower bound must fail. 

Unfortunately, also the ranking-based model does not help to overcome this unnatural low 
black-box complexity. We shall comment in Section [5] that the 3-ary unbiased ranking-based 
black-box complexity of LeadingOneS™, too, is 0(nlog(n)/loglogn). 

As for the memory-restricted model we note without proof that a memory of size 0(\/log re) 
suffices to achieve the same bound. 
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Algorithm 1: Scheme of an unrestricted black-box algorithm for optimizing / : S — > R 

1 Initialization: 

2 Sample x^ according to some probability distribution on S; 

3 Query /(x^); 

4 Optimization: 

5 for t = 1,2,3,... do 

6 Depending on ((x^°\ f(x^)), . . . , /(x^ -1 ^))) choose a probability 
distribution pW on 5; 

7 Sample according to p(*); 

8 Query f(x^); 



2 Preliminaries 

In this section we briefly introduce the two black-box models considered in this work, the 
unrestricted black-box model by Droste, Jansen, and Wegener [DJW06J and the unbiased black- 
box model by Lehre and Witt |LW10] . Due to space limitations, we keep the presentation as 
concise as possible. For a more detailed discussion of the two different black-box models and 
for the definition of the ranking-based versions considered in Section [5j we refer the reader 
to [DWllj . 

Before we introduce the two black-box models, let us fix some notation. For all positive 
integers k G N we abbreviate [k] := {1, . . . , k} and [0..k] := [k] U {0}. By e£ we denote the k-th 
unit vector (0, . . . , 0, 1, 0, . . . , 0) of length n. For a set / C [n] we abbreviate e" := Yliel e ^ = 
ffiie/e", where © denotes the bitwise exclusive-or. By S n we denote the set of all permutations 
of [n] and for x = (x±, . . . ,x n ) G {0, l} n and a G S n we abbreviate a(x) := (x a n\, . . . ,x a r n \). 
For any two strings x,y G {0, l} n let B(x,y) := {i G [n] \ Xi = yi}, the set of positions in which 
x and y coincide. 

For r G M>o, let [~r] := min{n G No | n > r} and [ r J := max{n G No | n < r}. For 
the purpose of readability we sometime omit the [•] signs, that is, whenever we write r where 
an integer is required, we implicitly mean \r \ . All logarithms log in this work are base two 
logarithms. By In we denote the logarithm to the base e := exp(l). 

Black-Box Complexity. Let A be a class of algorithms and J 7 be a class of functions. For 
every A £ A and / G T let T(A, /) G R U {oo} be the expected number of fitness evaluations 
until A queries for the first time some x G argmax/. We call T(A, f) the expected optimization 
time of A for f. The A-black-box complexity of T is T(A, F) := supj g jrT(A, /), the worst-case 
expected optimization time of A on J- . The A-black-box complexity of T is inf/ig^ T(A, the 
best worst-case expected optimization time an algorithm of A can exhibit on T . 

The Unrestricted Black-Box Model. The black-box complexity of a class of functions 
depends crucially on the class of algorithms under consideration. If the class A contains all 
(deterministic and randomized) algorithms, we refer to the respective complexity as the unre- 
stricted black-box complexity. This is the model by Droste, Jansen, and Wegener |D JW06| . The 
scheme of an unrestricted algorithm is presented in Algorithm [TJ 

Note that this algorithm runs forever. Since our performance measure is the expected 
number of iterations needed until for the first time an optimal search point is queried, we do 
not specify a termination criterion for black-box algorithms here. 

The Unbiased Black-Box Model. As observed already by Droste, Jansen, and We- 
gener [D JW06j , the unrestricted black-box complexity can be surprisingly low for different 
function classes. For example, it is shown in [DJW06 that the unrestricted black-box complex- 
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Algorithm 2: Scheme of a /c-ary unbiased black-box algorithm 



1 Initialization: 

2 Sample x^ ' G {0, l} n uniformly at random; 

3 Query /(x^); 

4 Optimization: 

5 for t = 1,2,3,... do 

6 Depending on (f(x^), . . . , /(x^ -1 ^)) choose up to k indices i%, . . . , G [0..t — 1] 
and a k-ary unbiased distribution D{- \ x^ n \ . . . ,x^ ih '); 

7 Sample according to D{- \ x^ n \ . . . ,x^ k ^); 

8 Query f(x®); 



ity of the NP-hard optimization problem MaxClique is polynomial. 

This motivated Lehre and Witt [LW10] to define a more restrictive class of algorithms, the 
so-called unbiased black-box model, where algorithms may generate new solution candidates only 
from random or previously generated search points and only by using unbiased variation opera- 
tors, cf. Definition [TJ Still the model captures most of the commonly studied search heuristics, 
such as many (/i + A) and (fj,, A) evolutionary algorithms, simulated annealing algorithms, the 
Metropolis algorithm, and the Randomized Local Search algorithm (confer the book [ADllj for 
the definitions of these algorithms). 

Definition 1 (Unbiased Variation Operator). For all k G N, a k-ary unbiased distribution 
[D{- 1 ■ ■ ■ , y^)) y (i) y (fc)g{oi}" is a family of probability distributions over {0,l} n such that 
for all inputs y^\ ■ ■ ■ ,y^ 6 {0, l} n the following two conditions hold. 

(i)Vx,ze {0,l} n : D(x | y (1) ,...,7/ (fe) ) =D(x®z \ yW z, . . . ,yW © z ) ; 
(«) Vx G {0, 1}" Va G S n : D(x \ . . . ,y^) = D(a(x) | a(y^), . . . ,a(y«)) . 

We refer to the first condition as ©-invariance and to the second as permutation invariance. 
An operator sampling from a k-ary unbiased distribution is called a k-ary unbiased variation 
operator. 

1-ary — also called unary — operators are sometimes referred to as mutation operators, in 
particular in the field of evolutionary computation. 2-ary — also called binary — operators are 
often referred to as crossover operators. If we allow arbitrary arity, we call the corresponding 
model the *-ary unbiased black-box model. 

A fc-ary unbiased black-box algorithm can now be described via the scheme of Algorithm [21 
The k-ary unbiased black-box complexity of some class of functions J- is the complexity of T 
with respect to all fe-ary unbiased black-box algorithms. 

3 On LeadingOnes n in the Unrestricted Model 

This section is devoted to the main contribution of this work, Theorem [TJ Recall from the 
introduction that we have defined 

Lo Zj(J : {0, l} n — > [0..n], x i— > maxji G [0..n] | Vj < i : z a u\ = x a u\} 

and LeadingOnes„ := {Lo z ^ \ z G {0, l} n , a G S n } . 

Theorem 1. The unrestricted black-box complexity of LEADINGONES n is 0(nlog(n)/loglogn). 
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The proof of Theorem [T] is technical. For this reason, we split it into several lemmata. The 
main proof can be found at the end of this section. We remark already here that the algorithm 
certifying Theorem Q] will make use of unbiased variation operators only. Hence, it also proves 
that the *-ary unbiased black-box complexity of LeadingOnes^ is 0(relog(re)/loglogn). This 
will be improved in Section [H 

The main idea of both the *-ary and the 3-ary algorithm is the following. Given a bit string x 
of fitness Lo Z)(T (x) = £, we iteratively first learn k := \/log n bit positions o(£ + 1), . . . , o(£ + k) 
and their corresponding bit values which we fix for all further iterations of the algorithm. 
Learning such a block of size k will require 0(k 3 / log k 2 ) queries. Since we have to optimize 
n/k such blocks, the overall expected optimization is 0(nk 2 / log k 2 ) = 0(nlog(n)/loglogn). 
In what follows, we shall formalize this idea. 

Convention: For all following statements let us fix a positive integer re, a bit string z G 
{0, l} n and a permutation a G S n . 

Definition 2 (Encoding Pairs). Let £ G [0..n] and let y G {0, l} n with Lo 2j(T (y) = I. If 
x G {0,1}™ satisfies Lo 2iCr (x) > Lo 2)(T (y) and I = \{i G [n] \ xi = yi}\ , we call (x,y) an 
^-encoding pair for Lo 2jCr . 

// (x,y) is an l-encoding pair for Lo 2)(T; the bit positions B(x,y) are called the ^-encoding 
bit positions of Lo Zi(7 and the bit positions j G [n]\B(x,y) are called non-encoding. 

If (x, y) is an ^-encoding pair for Lo 2jCr we clearly either have have Lo 2iCr (x) > £ or Lo 2>cr (x) = 
Lo 2i<J (?/) = re. For each non-optimal y G {0, l} n we call the unique bit position which needs 
to be flipped in y in order to increase the objective value of y the £-critical bit position of 
Lo 2jCr . Clearly, the ^-critical bit position of Lo 2jCr equals a{£+ 1), but since a is unknown to the 
algorithm we shall make use of this knowledge only in the analysis, and not in the definition 
of our algorithms. In the same spirit, we call the k bit positions a(£ + 1), . . . , a(£ + k) the k 
£-critical bit positions o/Lo 2j(7 . 

Lemma 2. Let £ G [0..n — 1] and let (x, y) G {0, 1}" x {0, 1}" be an £-encoding pair for Lo 2i(7 . 
Furthermore, let k G [n — Lo 2i(T (y)] and let y' G {0, 1}™ with £ < Lo 2iCr (y') < £ + k. 

If we create y" from y' by flipping each non-encoding bit position j G [n]\B{x,y) with 
probability 1/k, then 

Pr[Lo Z)a (y") > Lo z>rT {y')} > (eky 1 . 

Proof. First note that due to Lo 2j(J (?/ / ) > Lo 2jCr (y), we clearly have y'j = Xj for all ^-encoding 
bit positions j G B(x,y). Since we do not allow these bit positions B(x,y) to be changed, we 
necessarily also have Lo 2i(7 (y") > £. 

Let c G [0. Vlog n\ such that Lo 2jCT (y') = £ + c. Then Lo 2j(J (y") > Lo 2j(J (y / ) if and only if 
both 

(i) none of the c bit positions cr(Lo 2jCr (?/) + 1), . . . , a{ho z ^{y) + c) is flipped, and 

(ii) the £ + c-critical bit position a(£ + c + 1) is nipped. 
This yields 

Pr[Lo 2 , CT (y") > Lo z , a (y')] = (1 - i) c i > (1 - J)*"^ > (ek)- 1 , 
where we make use of the well-known fact that for all positive integers k it holds that (1— \) k ^ > 
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Algorithm 3: The {x, y)-encoded (1 + 1) evolutionary algorithm with mutation probabil- 
ity 1/k. 

1 Input: ^-encoding pair (x,y) G {0, l} n x {0, l} n . 

2 y' <r- y; 

3 while Lo 2i(7 (y') < Lo ZjCT (y) + k do 

4 y" <- random(y',x,y,l/fe); 

5 Query Lo 2j<7 (/); 

6 |_ if Lo 2j(T (y") > Lo z>a (y') then y' <- y"; 

7 Output y'; 



Lemma [2] motivates us to formulate Algorithm [3] which can be seen as a variant of the 
standard (1 + 1) EA, in which we fix some bit positions and where we apply a non-standard 
mutation probability. The variation operator random(y', x, y, 1/k) samples a bit string y" from 
y' by flipping each non-encoding bit position j G [n]\B(x, y) with probability 1/k. This is easily 
seen to be an unbiased variation operator of arity 3. 

The following statement follows easily from Lemma [2] and the linearity of expectation. 

Corollary 3. Let (x,y), £, and k be as in Lemma\^ Then the (x,y)- encoded (1 + 1) EA 
with mutation probability 1/k, after an expected number of 0(k 2 ) queries, outputs a bit string 
y' G {0, l} n with Lo z , (J (y / ) > £ + k. 

A second key argument in the proof of Theorem [1] is the following. Given an ^-encoding pair 
(x, y) and a bit string y' with Lo Z;CT (y / ) > £ + \/log n, we are able to learn the \/log n ^-critical 
bit positions o~{£+ 1), . . . , a(£ + \/\og n) in an expected number of O (log 3 / 2 (n)/ log log n) queries. 
This will be formalized in the following statements. 

Lemma 4. Let £ G [0..n— [\/log n \ ] and let (x, y) be an £-encoding pair for Lo 2j(J . Furthermore, 
let y' be a bit string with ho z ^ a (y') > £ + \/log n. 

For each i G [8e log 3 / 2 (n)/ log log n] let y % be sampled from y' by independently flipping each 
non-encoding bit position j G [n]\B(x,y) with probability 1/^/log n. 

For each c G [yTogn] let 

X c := {y i | i G [8e log 3 / 2 (n)/ log log n] and Lo Zj(T (y i ) =£ + c-l}, 

the set of all samples y l with Lo 2)CT (y J ) = £ + c — 1. 
Then 

Pr [Vc G [y/log n] : \X C \ > 41og(n)/loglogn] > 1 - o(l) . 

Proof. For readability purposes, let us abbreviate k := y4ogn. 

First, let us consider the size \X C \ for some fixed c G [k]. By the same arguments as used in 
the proof of Lemma [2] we have for any i G [8e log 3 / 2 (n) / log log n] that 

?r[Lo z ,M) =£ + c- !] = (!- I)- 1 ! > (1 - J J > ^ . 

Thus, E[|A C |] > 81og(n)/loglogn by the linearity of expectation. 

If we set Yi >c = 1 if ~Lo z ^(y l ) = £ + c — 1 and l^ jC = otherwise, we have \X C \ := 
^Selog/ (n)/loglogn y . c _ In particularj ig the 

sum of independent random variables. We can 
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thus apply Chernoff 's bounds (cf. [ADllj for a compact introduction) to bound the deviation 
of \X C \ from its expectation and obtain 

Pr[|X c | <§E[|X C |]] <exp(- §E[|X C |]) < exp ( - log(n)/loglogn) <l/logn, 

where the last inequality follows from log(n) / log log n > In log n for large enough n. By a simple 
union bound we conclude 

Pr [Vc G [ylog n] : \X C \ > 41og(n)/loglogn] > 1 — 1 / \/log n = 1 - o(l) . 

□ 

These sets X c are large enough to identify a(£ + c). 

Lemma 5. Let I, (x,y), and y' be as in Lemma^and let t := 41og(n)/loglogn. 

For any c G [\/log nj let X c be a set of at least t bit strings y 1 (c), . . . , y\ Xc -\(c) with fitness 
Lo Zj(7 (y J (c)) = l + c—1, which are sampled from y' by independently flipping each non-encoding 
bit position j G [n]\B(x,y) with probability l/ydogn. 

Then we have, with probability at least 1 — o{l), that for all c G [-v/log n\ there exists only one 
non-encoding j := ji +c G [n]\B(x, y) with y'j = 1 — J/J(c) for all i G [\X C \]. Clearly, j = a(£ + c). 

Proof. Let us first consider some fix value c G [ylogn]. For any i G [\X C \] we have, by definition, 
that Lo Zi(T (s/ , (c)) = l+c-1. Thus, it must hold that y^ +c) (c) = l-y^ +c) and y % ff ^(c) = y' a{j) 
for all j < l + c. Let 

l l+c := [n]\(B(x, y) U {a{£ + 1), . . . , a(£ + c)}) 
= [n]\{a(l),...,^ + c)}, 

Since all bit flips are mutually independent, we have for any i G [\X C \] and any fixed j G 2^ +c 
that the entry y*(c) in the j-th bit position equals 1 — y'j with probability 1/ylogn. Note that 
this remains true despite the fact that we condition on Lo^ )CT (y*) = £ + c— 1. 

Thus, for any j G 2^ +c we have 

Pr[Vi G [|X C |] : y)(c) = 1 - y'j] < (1/^1^)1**1 

< (i/v/Ioi^) 410g{n)/loslogn 

= 2- 21o § n = 1/n 2 . 

By the union bound, the probability that there exists a j G X^ +c with y* (c) = 1 — y'j for all 
i G [\X C \] is bounded from above by 1/n. 

And by again applying a union bound we find that 

Pr[3c G [ V / loi^]3j G X^i G [|X C |] : y){c) = 1 - $ < Tbg^/n = o(l) . 

□ 

Combining Lemma H] with Lemma [5] we immediately gain the following. 

Corollary 6. Let £, (x,y),y' , and y l , i = 1, . . . , 8elog 3 / 2 (n)/ log log n, be as in Lemma^ 

With probability at least 1 — o(l) we have that for all c G [i/log n\ there exists only one 
non-encoding j := ji +c G [n]\B(x,y) with y'- = 1 — y l - for all i G [8elog 3 / 2 (n)/loglogn] with 
Lo 2jCT (y i ) = I + c - 1. Clearly, j = o{i + c). 
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We are now ready to prove Theorem [TJ As mentioned above, the proof also shows that the 
statement remains correct if we consider the unbiased black-box model with arbitrary arity. 

of Theorem{J\ We need to show that there exists an algorithm which maximizes any (a priori 
unknown) function Lo 2jCT G LeadingOnes^ using, on average, 0(nlog(n)/loglogn) queries. 

For readability purposes, let us fix some function / = Lo 2)(T G LeadingOnes^ to be 
maximized by the algorithm. 

First, let us give a rough idea of our algorithm, Algorithm 01 A detailed analysis can be 
found below. 

The main idea is the following. We maximize / block-wise, where each block has a length 
of \/log n bits. Due to the influence of the permutation a on /, these bit positions are a 
priori unknown. Assume for the moment that we have an ^-encoding pair (x,y), where I G 
[0..n— [vTog n ] ] . In the beginning we have I = and y = x® (1, . . . , 1), the bitwise complement 
of x. To find an (£ + yTog~n)-encoding pair, we first create a string y' with objective value 
f(y') > ^ + \/log n. By Corollary [3l this requires on average O(logn) queries. Next, we need 
to identify the yTogn /(y)-critical bit positions a{£ + 1), . . . , o{l + yTogn). To this end, we 
sample enough bit strings such that we can unambiguously identify these bit positions. As we 
shall see, this requires on average 0(log 3 / 2 (n)/loglog n) queries. After identifying the critical 
bits, we update (x, y) to a [l + yl"ogra)-encoding pair. Since we need to optimize n/^\ogn such 
blocks of size ylogn, the overall expected optimization time is 0(n log (n)/ log log n). 

Let us now present a more detailed analysis. We start by querying two complementary bit 
strings x,y. By swapping x with y in case f(y) > f(x), we ensure that f(x) > f(y) = 0. This 
gives us a 0-encoding pair. 

Let an ^-encoding pair (x,y), for some fixed value I G [0..n — \y/log n]], be given. 
We show how from this we find an (I + \/log n)-encoding pair in an expected number of 
O (log 3 / 2 (n)/ log log n) queries. 

As mentioned above, we first find a bit string y' with objective value f(y') > I + \/log n. 
We do this by running Algorithm [3l the (x, y)-encoded (1 + 1) EA with mutation probabil- 
ity 1 / \/log n until we obtain such a bit string y' . By Corollary [3] this takes, on average, O(logn) 
queries. 

Next we want to identify the \J\og n ^-critical bit positions a(£+l), . . . , a(l+y/\og n). To this 
end, we query in the i-th iteration of the second phase, a bit string y 1 which has been created from 
y' by flipping each non-encoding bit y'-, j G [n]\B(x, y) independently with probability l/ydog n. 
If f(y l ) = l+c—1 for some c G [yJ\og n] , we update Xg +C ^— Xe +c L){y 1 }, the set of all queries with 
objective value t + c — 1, and we compute Ji+c := {j G [n]\B(x, y) \ \/w G X^ +c : wj = 1 — y'j}, 
the set of candidates for a {I + c). We do so until we find \J~e+ c \ = 1 f° r a fi c £ W^°S n \ ■ By 
Corollary [6] this takes, on average, at most 8e log 3 ^ 2 (n)/ log log n queries. 

Thus, all we need to do in the third step is to update (x, y) to an (£ + y/log n)-encoding pair 
by exploiting the information gathered in the second phase. For any c G [\/log n\ let us denote 
the element in Jz +C by je+ c - We go through the positions a(£ + 1), . . . , a(t + ^log n) one after 
the other and either we update y <— y e" +c (if f(y) < f(x)), and we update x <— x © e™ £+c 
otherwise. It is easy to verify that after y/\og n such steps we have f(x) > £ + y/logn and 
f(y) > £ + \/log n. It remains to swap (x, y) <— (y, x) in case f(y) > f(x) in order to obtain an 
{£ + \J\og n)-encoding pair (x, y). 

This shows how, given a ^-encoding pair (x,y), we find an (£ + y / Iogn)-encoding pair in 
O(logn) + 0(log 3 / 2 (n)/loglogn) + 0{yJ\ogn) = 0(log 3 / 2 (n)/loglogn) queries. 

By definition of Algorithm U all bit positions in B(x, y) remain untouched in all fur- 
ther iterations of the algorithm. Thus, in total, we need to optimize I , ,. n . I blocks 

L | V log n | J 

of size [ylogn] until we have a ([ , , n . j [v^og n 1 )-encoding pair (x,y). For each 
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Algorithm 4: A *-ary unbiased black-box algorithm for maximizing / G LEADlNGONES n . 
Recall that we have defined Jt+ C '■= {j G [n]\B(x, y) \ Vu> G Xg +C : Wj = 1 — y'j}. 

1 Initialization: 

2 for i = 1, . . . , n do Xi <— 0; 

3 Sample x G {0, l} n uniformly at random; 

4 Query f(x); 

5 Set y-G-x© (1,...,1); 

6 Query f(y); 

7 if f(y) > f(x) then <- (y,x); 

8 Optimization: 

9 while \B(x,y)\< [ |Vlo " gra| j fVbg^l do 
^|S(x,y)|; 

Apply Algorithm [3] with input (x,y) and mutation probability 1/ylogn until it 
outputs a bit string y' with f(y') > I + Vlog n; 
Initialize « -<— 1; 

, > 1 do 

,i 



10 
11 

12 
13 
14 
15 
16 
17 
18 

19 

20 
21 



while 3c G [y/log n\ : | 

y l G- random(y', x, y, l/Vbgn); 
Query 



if /<y ) G . . . , I + V\ogH - 1] then 



Update Xj(yi^ + i 
Update Jffyty, 



X 



U{y 4 }; 



for c = 1, . . . , V^Iogn do update(x, y, y' , Xi +C 
if f(y) > fix) then (x,y) <r- (y,x); 



22 Apply Algorithm [3] with input (x, y) and mutation probability 1 / \/log n until it queries 
for the first time a string y' with f{y') = n; 



block, the expected number of queries needed to fix the corresponding bit positions is 
0(log 3//2 (n)/loglogn). By linearity of expectation this yields a total expected optimization 
time of 0(n/ \/log n) O (log 3 / 2 (n) / log log n) = 0(nlog(n)/loglogn) for optimizing the first 
k := L ^Jgni J rVlog^l bit positions <r(l), . . . , a(k). 

The remaining n — k < \_y/\ogn\ bit positions can be found by Algorithm [3] in an expected 
number of O(logn) queries (Corollary [3]). This does not change the asymptotic number of 
queries needed to identify z. 

Putting everything together, we have shown that AlgorithmH]optimizes any function Lo ZtlJ G 
LEADiNGONES ra in an expected number of 0(nlog(n)/loglogn) queries. It is not difficult to 
verify that all variation operators are unbiased. We omit the details. □ 



4 The Unbiased Black-Box Complexity of LeadingOnes n 

Next we show how a slight modification of Algorithm [5] yields a 3-ary unbiased black-box 
algorithm with the same asymptotic expected optimization time. 

Theorem 7. The 3-ary unbiased black-box complexity of LeadingOnes„ is 
0(n log(n) / log log n) . 
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Algorithm 5: Subroutine update(x, y, y' , X< 



1 Input: An ^-encoding pair (x, y), a bit string y' with f(y') > i + \/log n, and, a set X"^ 
of samples w with /(w) = £ + c — 1 such that 

|^r +c | = \{j G [n]\fi(x,y) | € X c : ^ = 1- = 1; 

2 if /(y) < /(s) then 

3 
4 

5 else 

6 



Query /(y); 



7 



Query /(z); 



Proof. Key for this result is the fact that, instead of storing for any c G [ylogn] the whole query 
history Xg +C , we need to store only one additional bit string x l+c to keep all the information 
needed to determine a(£ + c). 

Algorithm [6] gives the full algorithm. Here, the bit string update2(u>, y', x l+c ) is defined 
via (update2(ii;, y' , x e+c )) . = Wi if i G [n]\£>(y', a/ +c ) and (update2(w, y', x £+c )) . = 1 — u>j for 
i e B{y',x e+C ). 

Note that, throughout the run of the algorithm, the pair (y',x i+c ), or more precisely, the 
set B(y',x e+C ) encodes which bit positions j are still possible to equal a(£ + c). Expressing 
the latter in the notation used in the proof of Theorem [TJ we have in any iteration of the first 
while-loop that for all i G [n] it holds y\ = xf +c if and only if % G Si+ C - This can be seen as 
follows. In the beginning, we only know that o~(l+c) ^ B(x, y). Thus, we initialize a^ +c <— 1—y'i 
if i G B{x,y) and a^ +c y[ for i G [n]\B(x,y). In each iteration of the second while-loop, we 
update xl +c <— 1 — y[ if a(£ + c) = i can no longer hold, i.e., if we have sampled a bit string w 
with /(to) = £ + c — 1 and W{ = y\. 

It is easily verified that Algorithm [fj] certifies Theorem [71 We omit a full proof in this 
extended abstract. □ 



5 LeadingOnes n in the Ranking-Based Models 

As discussed above, we introduced two ranking-based versions of the black-box complexity 
notion in |DW11| : the unbiased ranking-based and the unrestricted ranking-based black-box 
complexity. Instead of querying the absolute fitness values /(x), in the ranking-based model, 
the algorithms may only query the ranking of y among all previously queried search points, 
cf. [DW11| for motivation and formal definitions. We briefly remark the following. 

Theorem 8. The 3-ary unbiased ranking-based black-box complexity of LeadingOnes„ is 
0(n log(n) / log log n) . 

This theorem immediately implies that the unrestricted ranking-based black-box complexity 
of LEADlNGONES n is 0(n log(n)/ log log n) as well. 

Theorem [8] can be proven by combining the Algorithm [6] presented in the proof of Theorem [JJ 
with a sampling strategy as used in Lemma HI Although the latter is not optimal, it suffices to 
show that after sampling 0(log 3 ^ 2 (n)/loglogn) such samples, we can identify the rankings of 
f{£ + 1), . . . , f(£ + y/log n), with probability at least 1 — o(l). We do the sampling right after 
Line [10] of Algorithm [6l After having identified the rankings of f(£ + 1), . . . , f(£ + ylog n), we 
can continue as in Algorithm [6j 
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Algorithm 6: A 3-ary unbiased black-box algorithm for maximizing / G LEADlNGONES n . 

1 Initialization: 

2 Sample x G {0, 1}™ uniformly at random; 

3 Query /(x); 

4 Set y <<- x® (1, . . . ,1); 

5 Query f(y); 

6 if f(y) > f(x) then (x,y) <(- (y,x); 

7 Optimization: 

s while < LrvrfcrJ fv^l do 

9 
10 



n 

12 
13 

14 
15 
16 
17 
18 



19 
20 



21 



£<-\B{x,y)\; 

Apply Algorithm [3] with input (x,y) and mutation probability l/ylogn until it 
outputs a bit string y' with f(y') > i + ylogn; 
for c = 1, . . . , ^/log n do 
for z = 1, . . . , n do 
|_ if i G £>(x, y) then x^ +c «— 1 — ^ else x^ +c «— 



while 3c G [-y/log nj : |£>(x £+c , > 1 do 
w <— random(y / , x, y, 1/^/logn); 
Query f( w); 

if 3c G [Vlog n] : f(w) = I + c — 1 then 
|_ for i = 1, . . . , n do if xf +c = y' i = w i then xf +c <(— 1 - y-; 



for c = 1 , . . . , \/log n do 
|^ if /(y) < /(x) then update2(y, y', x e+c ) else update2(x, y', x e+c ); 

if /(y) > f(x) then (x,y) <- (y,x); 

22 Apply Algorithm [3] with input (x, y) and mutation probability l/i/Iogn until it queries 
for the first time a string y' with f{y') = n; 



6 Conclusions 

We have shown that there exists a 3-ary unbiased black-box algorithm which optimizes any 
function Lo 2)(T G LeadingOnes„ in an expected number of 0(nlog(n)/loglogn) queries. This 
establishes a new upper bound on the unrestricted and the 3-ary unbiased black-box complexity 

of LEADINGONES n . 

Our result raises several questions for future research. The obvious one is to close the 
gap between the currently best lower bound of Q(n) (cf. |DJW06j ) and our upper bound of 
0(nlog(n)/loglogn). Currently, we cannot even prove an u:(n) lower bound. Secondly, it 
would also be interesting to know whether the gap between the 2-ary and the 3-ary unbiased 
black-box model is an artifact of our analysis or whether 3- and higher arity operators are truly 
more powerful than binary ones. 
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